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Abstract. This paper is motivated by recent applications of Diophantine approximation 
in electronics, in particular, in the rapidly developing area of Interference Alignment. Some 
remarkable advances in this area give substantial credit to the fundamental Khintchine- 
Groshev Theorem and, in particular, to its far reaching generalisation for submanifolds of a 
Euclidean space. With a view towards the aforementioned applications, here we introduce 
and prove quantitative explicit generalisations of the Khintchine-Groshev Theorem for non¬ 
degenerate submanifolds of R n . The importance of such quantitative statements is explicitly 
discussed in Jafar’s monograph [131 §4.7.1]. 
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1. Introduction 

The present paper is motivated by a recent series of publications, including mm 
mmmmmmm, which utilize the theory of metric Diophantine approxima¬ 
tion to develop new approaches in interference alignment, a concept within the held 
of wireless communication networks. This new link is both surprising and striking. 
The key ingredient from the number theoretic side is the fundamental Khintchine- 
Groshev Theorem and its variations. In this paper we seek to address certain problems 
in Diophantine approximation which crop up, or impinge upon, the applications to 
interference alignment. The results obtained represent quantitative refinements of the 
Khintchine-Groshev Theorem that are relevant to the applications mentioned above. 
Indeed, the desirability of such quantitative statements is explicitly eluded to in Jafar’s 
monograph [131 §4.7.1]. 

Although the main emphasis will be on the Khintchine-Groshev Theorem for sub¬ 
manifolds of M n , we begin by considering the classical theory for systems of linear 
forms of independent variables. This approach has two benefits. Firstly, we are able to 
introduce the key ideas without too much technical machinery obscuring the picture. 
Secondly, the refinements of the classical theory produce effective results with much 
better constants. 

In order to recall Khintchine’s theorem we first define the set of if-well ap¬ 

proximate numbers. To this end, denote by M + the set of non-negative real numbers. 
Given a real positive function : M + —> R + with if(r) —> 0 as r —>■ oo, let then 

W(f>) := {i 6 K : \qx — p\ < 'if(q) for i.m. ( q,p ) eNxZ}, 

where ‘i.m.’ reads ‘infinitely many’. For obvious reasons the function if is often referred 
to as an approximating function. The points x in W(if) are characterized by the 
property that they admit approximation by rational points p/q with the error at most 
if(q)/q. 

A simple ‘volume’ argument together with the Borel-Cantelli Lemma from probabi¬ 
lity theory implies that 


|W(^)| = 0 if £ if{q ) < oo , 

9=1 

where |X| stands for the Lebesgue measure of X C M. The above convergence statement 
represents the easier part of the following beautiful result due to Khintchine which gives 
a criterion for the size of the set W(if) in terms of Lebesgue measure. In what follows, 
we say that X C t is full in M and write |A"| = FULL if |M \ X\ = 0; that is, the 
complement of X in M is of Lebesgue measure zero. The following is a slightly more 
general version of Khintchine, see [4j. 
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Theorem A (Khintchine, 1924). Let if be an approximating function. Then 


IMVOI 


o if 

< 

Full if 


oo 

< oo, 

5=1 

OO 

^^if{q) = 00 an d if is monotonic. 

5=1 


Thus, given any monotonic approximating function -0, for almost alfl x G M the 
inequality \x — p/q\ < if {q)/q holds for infinitely many rational numbers p/qii and only 
if the sum Yl^Li V’(?) diverges. 

There are various generalisations of Khintchine’s theorem to higher dimensions - 
see [3] for an overview. Here we shall consider the case of systems of linear forms which 
originates from a paper by Groshev in 1938. In what follows, m and n will denote 
positive integers and M m;n will stand for the set ofmxn matrices over M. Given a 
function T : TT —\ M + , let 

:= {X = (x id ) E M mn : ||Xa|| < T(a) for i.m. agZ"\ { 0 }} , 
where a = (ai,..., a n ), 

||Xa|| := max \\xi^a\ + ... + x itn a n \\ 

and ||a:|| := min{|x — k\ : k e Z} is the distance of x G M from the nearest integer. 
Given a subset A" in M mjn , we will write |A"| mn for its ambient (i.e. mn-dimensional) 
Lebesgue measure. It is easily seen that Wi x (^/) coincides with W(0) when T(g) = 
'ifdql). Therefore the following result is the natural extension of Theorem lAl to higher 
dimensions. Notice that there is no monotonicity assumption on the approximating 
function. 


Theorem B. Let m,n E N with nm > 1, if : N —» M + be an approximating function 
and 

OO 

; =Xb""V(?r- (i) 

5=1 

Let T : 7L n —>■ M + be given by T(a) := -0(|a|) for a = (cp,..., a n ) 6 Z"\ {0}, where 
|a| = maxi<j<„ |aj|. Then 


| W m ,n(^)|mn 


0 if < oo, 

Full if = oo. 


Theorem B was first obtained by Groshev under the assumption that q n if( K q) m is 
monotonic in the case of divergence. The redundancy of the monotonicity condition 
for n > 3 follows from Schmidt’s paper m Theorem 2] and for n — 1 from Gallagher’s 
paper ra- Theorem B as stated was eventually proved in [6] where the remaining case 

1 ‘For almost all’ means for all except from a set of Lebesgue measure zero. 
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of n — 2 was addressed. The convergence case of Theorem B is a relatively simple 
application of the Borel-Cantclli Lemma and it holds for arbitrary functions T. Thus 
together with Theorem A, we have the following extremely general statement in the 
case of convergence. 


Theorem C. Let m,n £ N and T : TT —» M + be any function such that the sum 


converges. Then 


E* := ^2 ^( a ) 

aez n \{0} 


|W m , n (^)| mn = 0. 


( 2 ) 


An immediate consequence of Theorem ICl is the following statement. 

Corollary 1. Let T be as in Theorem E3 Then, for almost every X £ M m ^ n there exists 
a constant a(X) > 0 such that 

||Xa|| > R(X)T(a) V a £ Z n \ {0} . (3) 


In recent years estimates of this kind have become an important ingredient in the 
study of the achievable number of degrees of freedom in various schemes on Interference 
Alignment from electronics communication — see, e.g., (El. The applications typically 
require that k(X) is independent of X. Unfortunately, this is impossible to guarantee 
with probability 1, that is on a set of full Lebesgue measure. To demonstrate this claim, 
let us define the following set : 

k) := {x £ M m , n : ||Xa|| > /ctf (a) V a £ Z n \ {0}} . (4) 

Then, for any n and T, the set £h, n (T, n) will not contain 

[-AT(a),ftT(a)] x M n_1 

with a = (1, 0,..., 0). This set is of positive probability. In the light of this example it 
becomes highly desirable to address the following problem : 

Problem. Investigate the dependence between k and the probability of 

As the first step to understanding this problem we obtain the following straightfor¬ 
ward consequence of Theorem C. 

Theorem 1. Let m,n £ N and p be a probability measure on M m , n that is absolutely 
continuous with respect to Lebesgue measure on M mtn . Let T : Z n —>- M + be any function 
such that (|2D converges. Then for any 5 £ (0,1) there is a constant k > 0 depending 
only on p, T and 5 such that 


p ( B m , n { « 0 ) > 1 - S. 


( 5 ) 
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Prior to giving a proof of this theorem recall that a measure /i on M m ^ n is absolutely 
continuous with respect to Lebesgue measure if there exists a Lebesgue integrable func¬ 
tion / : M mt n —> M + such that for every Lebesgue measurable subset A of M m _ n , one 
has that 

= f f, (6) 

J A 

where f A f is the Lebesgue integral of / over A. The function / is often referred to as 
the distribution (or density) of p. 

Proof. Since /i is absolutely continuous with respect to Lebesgue measure, Theorem C 
implies that /x(VP m , n (^ r )) = 0. Hence n(M m>n \ W mjn ('L)) = fjf M„ hn ) = 1. Note that 

U «) = M mt n \ . 

re >0 

Theorem [T] now follows on using the continuity of measures. □ 

In view of our previous discussion we have that k —> 0 as 6 —» 0. Then, the above 
problem specialises to the explicit understanding of the dependence of k on S. This 
will be the main content of the next section. Subsequent sections will be devoted 
to obtaining similar effective version of the convergence Khintchine-Groshev Theorem 
for non-degenerate submanifolds of M n . This constitutes the main substance of the 
paper. The results are obtained by exploiting the techniques of Bernik, Kleinbock and 
Margulis [8j originating from the seminal work of Kleinbock and Margulis [IT] on the 
Baker-Sprindzuk conjecture. 

2. The theory for independent variables 

To begin with we give an alternative proof of Theorem [T] which introduces an explicit 
construction that will be utilized for quantifying the dependence of k on 6. Indeed, in 
the case that fi is a uniform distribution on a unit cube the proof already identifies the 
required dependence. 

2.1. Theorem [I] revisited. By a unit cube in M m n we will mean a subset of M rn n 
given by 

{ ) £ Atm,n • ® i,j ^ ® i,j T 1} 

for some hxed matrix (aiij) G M rn;n . Given aeZ"\ {0} and £ > 0, let W(a, e) denote 
the set of X G M TOj „ such that 

||Xa|| <e. (7) 

It is easily seen that W(a, e) is invariant under additive translations by an integer 
matrix; that is, 

W(a,e) + B = W(a, e) 

for any B G M TOiTI (Z), where M m)ri (Z) denotes the set ofrnxn matrices with integer 
entries. Furthermore, we have that 

|bV(a, e) n P\ mn = (2£) m 


( 8 ) 
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for any 0 < e < | and any unit cube P in M mn . This follows, for example, from |20, 
Chapter 1, Lemma 8]. Then, since 

£* := ^ T(a) m < oo (9) 

aez n \{0} 


we must have that 

Mj, := sup{T(a) : a G Z n \ {0}} < oo. (10) 

In what follows we will assume that 


2 < 1 . ( 11 ) 
This condition ensures that we can apply ((HI) with e = /AD (a). 

Fix a unit cube P 0 in M m n and for each A e M m>n { Z), let 

Pa Cq + A 

denote the additive translation of Pq by A. Clearly, Pa itself is a unit cube. Further¬ 
more, 


M m , n = (J Pa- (12) 

Note that the union is disjoint. Using (JHJ) and the fact that 

M mjn \ k) = (J W(a, rT( a)), 

aez n \{0} 

we obtain that for each A 6 M m>n { Z), 

|-Pa \Sm,n(^,K)Un < ^ |W(a, /ctf(a)) n P A \mn 

aez n \{0} 


= ^ ( 2 «^( a )) m = (2<PS*. (13) 

aez n \{0} 

Since // is a probability measure, it follows from (1121) that there exists a finite subset 
A C M m>ri (Z) such that 

U U P a) >1-V2- (14) 

\AeA / 

Let N = be the number of elements in A. Since /i is absolutely continuous with 

respect to Lebesgue measure, for every A e A and any e\ > 0, there exists £2 such that 

for any measurable subset X of P&, 

|AT|mra < £2 = ^ > fl(X') < £\ . (15) 

I 11 view of (US]) , applying (fT51) to X = Pa \ H m , n (^,K) and £1 = 5/(2A) implies the 
existence of 


e 2 = £ 2 (A,5, N) > 0 
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such that 

fj,(PA\B m , n (V, K ))<8/(2N) if (2«) m £* <£ 2 (A,S,N). (16) 

In particular, the second inequality in (1T6l) holds if 

1 fe 2 (A,6,N)Y /m 

K - ;= 2 ( ) ' 

Since A is finite, there exists k satisfying (ITT]) and 

0 < hi < min ka • 

AeA 

Clearly, for such a choice of k the first inequality in (fT6j) holds for any A G A Hence, 
by m an d the additivity of // we obtain that 

£ 

\ S m , n ($, K)) < - + ^ p(P A \ K)) 

AeA 

5^5 5 Ar 6 

< -+> — = - + N — = 5. 

~ 2 ^ 2N 2 2 N 

AeA 

The upshot of this is that 

/i(H m , n (T, k)) = 1 - n{M m , n \ H,n,n(T, k)) > 1 - 5 , (17) 

which completes the proof of Theorem [0 


2.2. Quantifying the dependence of k on 5 . We now turn our attention to quan¬ 
tifying the dependence of k on <5 within the context of Theorem CD To this end, we will 
make use of the L p norm. Given a Lebesgue measurable function / : M m)n —y M + , a 
measurable subset X of M m n and p > 1, we write / G L p (X) if the Lebesgue integral 



exists and is finite. Here Xx is the characteristic function of A. For / G L p (X), the L p 
norm of / on X is defined by 



In the case p = oo, the Loo-norm on X is defined as the essential maximum of |/| on 
A"; that is, 

||/||oo,x : = inf {c G M : \f{x)\ < c for almost all x G A"} . 

If ||/||oo,x < cxd, then we write / G L 0O (A). For example, if / is continuous and A" is a 
non-empty open subset of M m , n , then ||/|| oo , a : is simply the supremum of / on X . The 
following lemma gathers together two well know facts regarding the L p norm. 


Lemma 1. 
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(1) For any p > 1 and any measurable subsets X C Y, 


p,X < 


p,Y 


(2) (Holder’s inequality) For any 1 < p, q < oo satisfying + I 




1, 


The next lemma is a corollary of Lemma [0 

Lemma 2. Let p > 1 and p be a probability measure on M m ^ n with density f. Let X 
be a Lebesgue measurable subset of M m;n . If f e L P (X), then 

MX) < II/IUIXIL 1/P . 

Proof. By definition, we have that 

hP0= [ f. 

J x 

Define q by the equation I + I — 1. Then by Holder’s inequality, we have that 

MX) = £/x 1 < ||/|| p ,x||i|U = ll/luQU’) /S < ll/IUIxi™„ 1/p 

as required. □ 

We are now in the position to provide an effective version of Theorem [D Let Po and 
A be the same as in 1 12.11 In particular, assume that (Tl4l) holds. Furthermore, assume 
that there exists some p > 1 such that for every A £ A, the density / of p has finite 
L p norm on Pa • 

Let hi be such that m is satisfied. In this case, (1T31) holds for every Pa with A 6 A. 
By Lemmas [□ and El 

MPa \ k)) < ||/|| rFa ■ |Pa \ Ml™ 7 ” • 

Using (1T31) . we obtain that 

/ \ i-i /p 

f(Pa \ k)) < \\f\\ p ,p A • ((2 K ) m £*J 

where is given by (EJ). It follows that 

£ 

p(M m ^ n \B mpx (^ ,n)) < - + ^ p(P A \ k )) 

AeA 

S / \ i-i/p 

< -+((2k)’“E* j E, < S 


(18) 
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if 


where 


f p \ 1/m 

(A A 

* V 2E ( 






p,Pa 


(19) 


AeA 


Since A is finite, the quantity Ej is also finite. The upshot of the above discussion is 
the following statement. 

Theorem 2 (Effective version of Theorem [I]). Let m, n G N, p, T be as in Theorem [I], 
let My be given by (ITOT) and let f denote the density of p. Furthermore, let P 0 be any 
unit cube in M m n and A be any finite subset of M mn (fL) satisfying (1141) . Assume there 
exists p > 1 such that f G L p (P/fi) for any A G A and also assume that the quantity Ey 
is given by (TTUll . Then, for any 5 G (0,1), inequality (J5J) holds with 


k — nun 
2 


i-i 


Mvt 


2E i 


P 

P-1 


1/m ' 


( 20 ) 


In this formula, the quotient p/(p — 1) should be taken as equal to 1 when p = oo. 

Remark 1. In the case when T is even, that is T(—a) = T(a) for all a e Z n \ {0}, 
one can improve formula (|20l) for k by replacing E^ with |E^. This is an obvious 
consequence of the fact that in this case the sets W(a,/c\P(a)) and W(—a, rT(— a)) 
coincide and therefore do not have to be counted twice within the proof. 

There are various simplifications and specialisations of Theorem [21 when we have 
extra information regarding the measure p. The following is a natural corollary which 
is particularly relevant for probability measures p with bounded distribution / and 
mean value about the origin. 

Corollary 2. Let m,n G N, p, T, My be as in Theorem\T\ and let the density f of 
p be bounded above by a constant K > 0. Furthermore, let T be the smallest positive 
integer such that 

p([-T,T) mn ) > 1 -5/2. (21) 

Then, for any 5 G (0,1) , inequality © holds with 


k — nun 
2 


My ’ \2K(2Ty 


l/m ' 


( 22 ) 


Proof. With respect to Theorem [2l let p — oo and A be the collection of cubes Pa that 
exactly tiles [—T,T) mn . Then, #A = (2T) mn and thus Ej < K(2T) mn . Now, (122|) 
trivially follows from (|20|) . □ 
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2.3. Numerical examples. In what follows, we will use the standard Gaussian error 
function 


erf(x) := 


-t- 


'/ 2 d t. 


V 27T J —oo 

It is readily verified that the function erf is continuous, strictly increasing and that 

lim erf(x) = 0 and lim erf (a;) = 1. 

X—y— oo x —>-+oo 

As usual, for 0 < y < 1, define erf _1 (i/) to be the unique value x G M such that 
erf(x) = y. Furthermore, define formally erf _1 (0) := — oo and erf _1 (l) := +oo. 

Consider now Corollary [2] in the case when m — n — 1 and when /i follows the 
standard Gaussian distribution A/"(0,1). It can then be verified that Corollary [2] implies 
that inequality ([5]) holds with 

-AS;' 

where N := [erf -1 (1 — 5/4)]. Here [a;] is the “ceiling” of x, that is the smallest 
integer that is bigger than or equal to x G M. We now consider explicit approximating 
functions. First, let T be the function given by T(g) = 0 if q < 0, 

4/(g) := -- 2 if q > 2 and 41(1):= 1/2. 


2 q ■ log q 


Then < 1.555 and on making use of 
of N and k : 


we obtain the following table for values 


s 

0.5 

0.25 

0.1 

0.01 

o 

1 

w 

10~ b 

N 

2 

2 

3 

4 

4 

5 

hi 

0.05 

0.025 

0.0067 

5•lO" 4 

5•10" 5 

1— 1 
o 

1 


It follows for instance from this set of data that for 99% of the values of the random 
variable x with normal distribution Af(0, 1), one has that 

1 


||gx|| > 


2000 


• T(g) for all q E N. 


y(q) ■= { ® 


In the next example, we fix a Q G N and consider the approximating function T 
given by 

1 if 1 <q<Q, 

otherwise. 

Then = 1 and one can readily verify that 

(i) for at least 75% of the values of the random variable x with normal distribution 
A/"(0,1), one has that 

1 


\\qx\\ > 


13 Q 


for all q G [-Q, Q], q % 0, 
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(ii) for at least 90% of the values of the random variable x with normal distribution 
Af( 0,1), one has that 


IMI > ^ for all q G [-Q, Q], q % 0. 


3. Diophantine approximation on manifolds 


The aim is to establish an analogue of Theorem 0 for submanifolds M. of K n . More 
precisely, we consider the set £>„(\l/, k) flAf, where 


: = B 1 iB (*,k) . 


The fact that the points of interest are of dependent variables, reflecting the fact that 
they lie on A4, introduces major difficulties in attempting to describe the measure 
theoretic structure of £> n ( l I J , k) D AT 

Non-degenerate manifolds. In order to make any reasonable progress with the above 
problems it is not unreasonable to assume that the manifolds M. under consideration are 
non-degenerate. Essentially, these are smooth submanifolds of which are sufficiently 
curved so as to deviate from any hyperplane. Formally, a manifold Af of dimension 
d embedded in W 1 is said to be non-degenerate if it arises from a non-degenerate 
map f : U —» M n where U is an open subset of and M. := f(U). The map 
f : U —* M n ,x i— y f(x) = (/i(x),..., f n (x)) is said to be l-non-degenerate at x G U, 
where l G N, if f is l times continuously differentiable on some sufficiently small ball 
centred at x and the partial derivatives of f at x of orders up to l span M n . The map 
f is non-degenerate at x if it is /-non-degenerate at x for some I G N. As is well 
known, any real connected analytic manifold not contained in any hyperplane of M n is 
non-degenerate at every point [T3] , 

Observe that if the dimension of the manifold A4 is strictly less than n then we 
have that k) fl Al| n = 0 irrespective of the approximating function T and k. 

Thus, when referring to the Lebesgue measure of the set £> n (T, n) fl Ai it is always with 
reference to the induced Lebesgue measure on AT More generally, given a subset S 
of M we shall write |Sjv'( for the measure of S with respect to the induced Lebesgue 
measure on Ah Without loss of generality, we will assume that |Al|x = 1 as otherwise 
the measure can be re-normalized accordingly. 

The following statement is a straightforward consequence of the main result of Bernik, 
Kleinbock and Margulis in [8]. 

Theorem BKM. Let A t be a non-degenerate submanifold of R n . Let T : Z n —* M + 
be monotonically decreasing in each variable and such that 



T(q) < oo . 


(24) 


qez™\{0} 


Then, for any 5 G (0,1), there is a constant k > 0 depending on AT and 5 only 
such that 
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B n (\ b, k) ft A4\m > 1 — 5. 


( 25 ) 


Remark 2. Theorem BKM holds for arbitrary probability measures supported on Ai 
that are absolutely continuous with respect to the induced Lebesgue measure on Ai, 
thus giving an analogue of Theorem [2] for manifolds. As in the case of Theorem [U the 
more general result follows from the Lebesgue statement. 


It is worth pointing out that the main result in [8j actually implies that the union 
IUA(* , k,) fl Ai has full measure on Ai. Theorem BKM as stated above follows 
from [8j Theorem l.lj^on using the continuity of measures. Our main goal is to quantify 
the dependence of n on 5. Theorem [6] of $6] below explicitly quantifies this dependence. 
However, the statement is rather technical and we prefer to state for now a cleaner 
result that shows that the dependency between k and 5 is polynomial. 


Theorem 3. Let l G N and let Ad be a compact d-dimensional C l+l submanifold of 
R" that is l-non-degenerate at every point. Let p be a probability measure supported 
on A4 absolutely continuous with respect to \-\m- Let T : Z n —y R + be a monotonically 
decreasing function in each variable satisfying (1211) . Then there exist positive constants 
hi 0 , Cq, Ci depending on T and Ai only such that for any 0 < 5 < l, the inequality 


(jt{B n {V,K) n Ai) >1-5 

holds with 

ft — min {(to, CoEiU f.'ii'"" 1 ’' 2 ' ’} . 


(26) 

(27) 


4. Diophantine approximation on manifolds and wireless technology 


In short, interference alignment is a linear precoding technique that attempts to align 
signals in time, frequency, or space. The following exposition is an attempt to illustrate 
at a basic level the role of Diophantine approximation in implementing this technique. 
We stress that this section is not meant for the “electronics” experts. We consider 
two examples. The first basic example brings into play the theory of Diophantine 
approximation while the second slightly more complicated example also brings into 
play the manifold theory. 

EXAMPLE 1. There are two people (users) Si and S 2 who wish to send (transmit) a 
message (signal) u G {0,1} and v G {0,1} respectively along a single communication 
channel (could be a cable or radio channel) to a person (receiver) R. Suppose there 
is a certain degree of fading (channel coefficients) associated with the messages during 
transmission along the channel. This for instance could be dependent on the distance 
of the users to the receiver and in the case of a radio channel, the reflection caused 
by obstacles such as buildings in the path of the signal. It is worth stressing that 
this aspect of “fading” associated with a signal should not be confused with the more 

^Throughout, results and page numbers within [8] are with reference to the arXiv version: math/0210298vl 
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familiar aspect of a signal being corrupted by “noise” that will be discussed a little 
later. Let hi and h 2 denote the fading factors associated with the messages being sent 
by Si and S 2 respectively. These are strictly positive numbers and assume their sum 
is one. Also, assume that the channel is additive. That is to say that R receives the 
message: 

y = h\X\ + h 2 x 2 where X\ — u and x 2 = v. (28) 

Specifically, the outcomes of y are 


'0 

hi 

h 2 

k 1 — h\ + h 2 


if 

if 

if 

if 


u = v = 0 
u — 0 and v — \ 
u = 1 and v = 0 
u = v = 1, 


(29) 


and if hi ^ h 2 , the receiver is obviously able to recover the messages u and v. Moreover, 
the greater the mutual separation of the above four outcomes in the unit interval 
/ = [0,1], the better the tolerance for error (noise) during the transmission of the signal. 
The noise can be a combinations of various factors but often the largest contributing 
factor is the interference caused by other communication channels. If z denotes the 
noise, then instead of (|28]h in practice R receives the message: 

y = hiXi + h 2 x 2 + z where x\ — u and x 2 = v. (30) 


Now let d denote the minimum distance between the four outcomes of y e / which 
are explicitly given by (J29]). Then as long as the absolute value \z\ of the noise is strictly 
less than d/ 2, the receiver is able to recover the messages u and v. This is simply due 
to the fact that intervals of radius d/2 centered at the four outcomes of y are disjoint. 
In this basic example, it is easy to see that the maximum separation between the four 
outcomes is attained when h\ = 1/3 and h 2 = 2/3. In this case d — 1/3, and we are able 
to recover the messages u and v as long as \z\ < 1/6. The upshot is that the closer the 
real numbers hi and h 2 are to 1/3 and 2/3 the better the tolerance for noise. Hence, 
at the most fundamental level we are interested in the simultaneous approximation 
property of real numbers by rational numbers. In practice, it is the probabilistic aspect 
of the approximation property that is important - knowing that the numbers hi and h 2 
lie within a ‘desirable’ neighbourhood of the points 1/3 and 2/3 with reasonably high 
probability is key. This naturally brings into play the theory of metric Diophantine 
approximation. 

hi h 2 

H-)-(— hH -{—I 

1 2 i 

3 3 

Note that from a probabilistic point of view, the chances that hi = h 2 is zero and 
is therefore insignificant. Furthermore, within the context of this basic example, by 
weighting ( precoding ) the messages u and v appropriately before the transmission stage 
it is possible to ensure optimal separation (d — 1/3) at the receiver regardless of the 


z 

P4 
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values of h\ and h 2 . Indeed, suppose x\ = \h^ l u and yi = are transmitted 

instead of u and v. Then, without taking noise into consideration, R receives the 
message 

y = h v x\ + h 2 x 2 — \u+ |u (31) 

and so the specihcs outcomes are 


0 

if 

u = 

= V 

= 0 

1/3 

if 

u = 

= 0 

and v 

2/3 

if 

u = 

= 1 

and v 

1 

if 

u = 

= V 

= 1 . 


(32) 


EXAMPLE 2. There are two users Si and S 2 as before but this time there are also 
two receivers R.\ and R 2 . Suppose Si wishes to simultaneously transmit independent 
signals u\ and v\ as a single signal, say x\ = u\ + v\ where u\ is intended for R\ and 
V\ for R 2 . Similarly, suppose S 2 wishes to simultaneously transmit independent signals 
u 2 and v 2 as a single signal, say x 2 = u 2 + v 2 where u 2 is intended for Ri and v 2 for 
R 2 . As in the hrst example, for the sake of simplicity, we can assume that the signals 
u\,u 2 ,vi,v 2 G {0,1}. Now let hu and h 2 \ denote the channel coefficients associated 
with signals being sent by S\ to R\ and R 2 respectively. Similarly, let h 12 and h 22 
denote the channel coefficients associated with signals being sent by S 2 to R\ and R 2 . 
Assume that the channel is additive and let ij\ (respectively y 2 ) denote the signal at 
receiver Ri (respectively R 2 ). Thus, 


y 1 = huxi + h l2 x 2 


(33) 


where 


y 2 = h 2l xi + h 22 x 2 


X\ = Ui + V\ and x 2 = u 2 + v 2 . 


(34) 

(35) 


u i,Vi 


Si 


S2 



U 2 : v 2 
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. Recall, that Ri (respectively R- 2 ) only cares about recovering the signals u i and u 2 (re¬ 
spectively Vi and v 2 ) from y 1 (respectively y 2 ). For the moment, let us just concentrate 
on the signal received by R,\ ; namely 

yi = huUi + hi 2 u 2 + hnVi + h 12 v 2 . 

It is easily seen that this corresponds to a received signal in Example 1 modified to 
incorporate four users and one receiver. This time there are potentially 16 different 
outcomes. In short, the more users, the more outcomes and therefore the smaller the 
mutual separation between them and in turn the smaller the tolerance for noise. Now 
there is one aspect of the setup in this example that we have not yet exploited. The 

receiver Ri is not interested in the signals V\ and v 2 . So if they could be deliberately 

aligned via precoding into a single component V\ +v 2 , then yi would look like a received 
signal associated with just 3 users rather than 4. With this in mind, suppose instead 
of transmitting x\ and x 2 given by (1551) . S\ and S 2 transmit the signals 

x\ = h 22 ui + h l2 v\ and x 2 = h 2l u 2 + huV 2 (36) 

respectively. Then, it can be veriEed that the received signals given by (1441) and (1541) 
can be written as 

Vi — {hi\h 22 )ui + {h 2 \h\ 2 )u 2 + (/in/ii 2 )(ui + v 2 ) 

V 2 — ( h 2 ihi 2 )vi + ( huh 22 )v 2 + (h 2 \h 22 )(ui + u 2 ). 

In other words, the unwanted, interfering signals at either receiver are aligned to a one 
dimensional subspace of four dimensional space. Notice that in the above equations 
the six coefficients are only of four variables, namely i,j = 1, 2, and thus represent 
dependent quantities. This, together with our findings from Example 1, naturally 
brings into play the manifold theory of metric Diophantine approximation. 

Example 2 is a simplified version of Example 3 appearing in [15], §111]. For a deeper 
and more practical understanding of the link between interference alignment and metric 
Diophantine approximation on manifolds the reader is urged to look at [15] and [T51 
§4.7], 


5. Preliminaries for Theorem [5] 

5.1. Localisation and parameterisation. Since A4 is non-degenerate everywhere, 
we can restrict ourself to considering a sufficiently small neighbourhood of an arbitrary 
point on A4. By compactness, A4 then can be covered with a finite subcollection of 
such neighbourhoods. Therefore, in view of the finiteness of the cover, the existence 
of k 0 , Co and C\ satisfying Theorem [5] globally will follow from the existence of these 
parameters for every neighbourhood in the finite cover : kq, Co and C\ should be taken 
to be the minimum of their local values. 

Now as we can work with A4 locally, we can parameterize it with some map f : U — y 
K n defined on a ball U in W 1 , where d = dimAL Note that f must be at least C 2 in 
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order to ensure that M. is non-degenerate. Without loss of generality we assume that 

M = {f (x) : x G U} . 

Furthermore, using the Implicit Function Theorem if necessary, we can make f to be 
a Monge parametrisation, that is f(x) = (aq,..., Xd, /d+i(x),..., / n (x)), where x = 
(xi,..., Xd)- Note that f can be assumed to be bi-Lipschitz on U. This readily follows 
from the fact that f is C 1 but possibly requires a further shrinking of U. 

Let £> n (T,ft, M) denote the orthogonal projection of B n (^, k) fl Ml onto the set of 
parameters U. Thus, 

a, M) := {x G U : ||a.f(x)|| > k^( a) for all a G Z n , a^O}. (37) 

The set £> n (T, n) Cl M. and its projection £> n (\l/, k, A4) are related by the bi-Lipschitz 
map f. Since bi-Lipschitz maps only affect the Lebesgue measure of a set by a multi¬ 
plicative constant (in this case the constant will depend on f only), it suffices to prove 
Theorem [3] for the project set. More precisely, Theorem [3] is equivalent to showing that 
there exist positive constants k 0 , Co and C\ > 0 depending on T and f only such that 
for any 0 < 5 < 1, 

\B n (*, K ,M)\ d > (l-5)|U| d (38) 

holds with k given by (|2Tj) . 

5.2. Auxiliary statements. We will denote the standard L\ (resp. Euclidean, infini¬ 
ty) norm on W l by || . || x (resp. || . || 2 , || . || ). Also as before, given an iGl, ||x|| will 
denote the distance of x from the nearest integer. The notation B(x, r) will refer to the 
Euclidean open ball of radius r > 0 centered at x and will denote the unit sphere 
in dimension d > 1 (with respect to the Euclidean norm). Furthermore, throughout 


' r (1 + d/2) 

is the volume of the d-dimensional unit ball and Nj denotes the Besicovitch covering 
constant. 

Remark 3. For further details on the Besicovitch covering constant, cf. [9j. We will 
only need in what follows the inequality Nd < 5 d satisfied by this constant. 

The proof of Theorem [3] involves two separate cases that take into consideration 
the relative size of the gradient of f(x) • q, where q = (q\,... ,q n ) G U 1 \ {0} and 
f(x) • q = + ■ ■ ■ + f n (-x)q n is the standard inner product of f(x) and q. The 

first case of ‘big gradient’ is considered within the next result and is an adaptation of 
[8j Theorem 1.3]. 

In what follows, dp will denote partial derivation with respect to a multi-index 
/3 = (/3i,... , /3d) G Hq, where N 0 will stand for the set of non-negative integers, that 
is No := {0,1,2,... }. Furthermore, |/3| we will mean the order of derivation, that is 
|/31 := f3i + ■■■ + (3d- Also, d/ will denote the differential operator corresponding to the 
k th derivative with respect to the i th variable, that is, d/ := d k /dx\. 
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Theorem 4. Let U C M. d be a ball of radius r and f G C 2 (2U), where 2U is the ball 
with the same centre as U and radius 2 r. Let 


L* := sup ||fyf(x)|| c 

|/3|=2, xS2U 


and L := max ( L*, 


4t"2 


Then, for every 5' > 0 and every q 6 Z“\ {0} ; the set o/x G U suc/i that 

l|f(x) - q|| < 

and 

IIVf(x)q|| 0O > v'ndLUqlloo 

has measure at most K d 5'\XJ\d, where Vf(x)q is the gradient o/f(x) • q and 

4 2d+1 d d / 2 N d 


K a := 


Ki 


is a constant depending on d only. 


(39) 


(40) 

(41) 


Proof. The proof of Theorem [4] follows on appropriately applying [8| Lemma 2.2], For 
convenience we refer to this lemma as L2.2. We take M in L2.2 to be equal to the 
quantity ndL, where L is defined by (l39j) . We set 5 in L2.2 to be equal to 5' appearing 
in Theorem |4j Then, in view of (139|) and the fact that n, d, ||q||oo > 1, it follows 
that the hypotheses of L2.2 (namely (8^, Eq(2.1a) & Eq(2.1b)]) are satished. Thus, 
the conclusion of L2.2 implies Theorem [4] with the constant C d in L2.2 equal to Kd 
appearing in Theorem [4] The explicit value of Kd is calculated by ‘tracking’ the values 
of the auxiliary constants C' d , C'f and Cf appearing in [8]. NameljH, 


and then 



V d 

2 2^/2’ 


^ 4 // r ) d -\-2 


/-ill I 

°d 


c: 

cy 


K d = 2 d Cf'N d 


2 d C'fN d 

C'd 


2 M+2 d d/2 Nd 

W 


□ 


Next in Theorem [5] below we consider the case of ‘small gradient’. This is an explicit 
version of p3j Theorem 1.4]. First we introduce auxiliary constants. 

Given a C l map f : U — y R n defined on a ball U in M. d , the supremum of s G R such 
that for any x G U and any v G S n_1 there exists an integer k, 0 < k < l, and a unit 


o 

There are two typos in the proof of L2.2 that one should be aware of when verifying the values of the 
constants given here. On page 6 line -2, the inclusion regarding U(x) is the wrong way round, it should read 
U(x) D B(x, jtj). Next, on page 7 line 11, in the rightmost term of the displayed set of inequalities the 
quantity S is missing, it should read C'f S\U(x)\d- These typos do not affect the validity of the proof given 

in 0. 
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vector u G 1 satisfying 


d k {i ■ v) 
du 1 ' 



> s 


( 42 ) 


will be called the measure of l-non-degeneracy of (f, U) and will be denoted by s(Z; f, U). 
Here and elsewhere for a unit vector u G § d_1 , d k /du k will denote the derivative in 
direction u of order k. 


As in Theorem QJ the radius of the ball U will be denoted by r. Throughout, we 
let x 0 denote the centre of U. Also, given a real number A > 0, we let AU denote the 
scaled ball of radius A r and with the same centre x 0 as that of U. With this in mind, 
consider the balls 

U+ := 3 d+ 1 U, 

U : = 3 n+ 1 U, (43) 

U+ : = 3 n+d+ 2 U. 


For technical reasons, that will soon become apparent, in order to deal with the ‘small 
gradient’ case we make the following assumption on the map f : U —» M n . 

Assumption 1. The map f = (/i,..., f n ) is an n-tuple of C l+1 functions defined 
on the closure of U + which is l-non-degenerate everywhere on the closure of U + . 


Remark. In view of the discussion of 1 15.11 there is no loss of generality in imposing 
Assumption 1 within the context of Theorem [3j 


We denote by so the measure of non-degeneracy of f on U + . Note that Assumption 1 
ensures that 

s 0 := s(l] f, U + ) > 0. (44) 

Also, notice that it ensures the existence of a constant M > 1 such that for all k <1 + 1 
and all Ui,..., u* G § d_1 , 


sup 

xeu+ 


dH 


<9ui ... dui. 


< M , 


(45) 


where chij means differentiation in direction Uj. Note that the left-hand side of ()42l) is 
the length of the projection of d k f(x)/du k on the line passing through v and hence it 
is no bigger than M. This implies that 


Sq < M. 


Without loss of generality, we will assume that the radius r of the ball U satisfies 

s 0 -a(l,d) r]s 0 


r < min 


2 . 3 n+d+ 2 ^ M ’ 4 .10 7 3 n+d+2 dMl l+2 (l + !)!/’ 


(46) 


where 


// := mm < —, 


V A 


16’ \2 d + 2 dl(l + iy/ l 5 d 




(47) 
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and where 


<9 •= 23^ ■ 4 ’ ( ■ (2') 2+,( '- 1)/2 ■ (f + idT’. 2, i) ’ 


with the quantity B , k ) defined as 

k(k-l)/2 

4>(oj,B,k) := —-=- 

2V2- • (fc + 1)! 

for any integer P > 1 and any real numbers u, B > 0. 

Furthermore, define the following constants determined by f and U: 

Pi := -—- 7 = (2 r ) 1 , 

4l l (l + l)\Vd 


Al l (l + l)V 


P2 ■ = 


SO ( T 


4I l (Z + l)! \M 


so 

41'(1+1) 


(r(l-l/^)) a 

i te)) + ( r ( x + 75)) 


Finally, let 


'p\ + (pa + 2 M 2 ) 5 


Theorem 5. Let U C 6 e a ball and f = (/ 1 ,..., / n ) fre an n-tuple of C l+1 functions 
satisfying Assumption 1. Then, for any 0 < S' < 1, any n-tuple T = (Ti, ... ,T n ) of 
real numbers > 1 and any K > 0 satisfying 

8'KTi . T n 

- 1 - 53 

maxi=i Ti 

define the set A(S',K,T) to be 

( ( ll f ( x )-q|l<^ 1 

A(8', K , T) := \ x G U : 3 q G Z n \ {0} such that { || Vf(x)q|| 0O < K > . (54) 


Then 


where 


| A(5\ K, T)| d <E[ y/n + d + 1 • £1 


\qi\ <Ti,i = l,...,n 


l/d(2l—l) 


,, / 8'KTi . T n 

e 1 := max d , -—- 

\ max 1 i 

, \ Ki<n 


E := C(n + l)(3 d 7V d ) n+ 1 p- 1/d(2,_1) , 
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in which p is given by (1521) and C is the constant explicitly given by (ITTj) below. 

At first glance the statement of Theorem [5] looks very similar to [ 8 f Theorem 1.4]. 
We stress that the key difference is that in our statement the constants are made fully 
explicit. The proof of Theorem 0 is rather involved and will be the subject of §3 


6 . A STRENGTHENING AND PROOF OF THEOREM [3] 


In view of the discussion of 1 15.11 Theorem [3] will follow immediately on establishing a 
stronger result (Theorem [ 6 ] below), which explicitly characterizes the dependence on T 
and A4 of the constants k 0 , C 0 and C\ appearing within the statement of Theorem [3] 
In the case that the function f defining the manifold under consideration is explicitly 
given, the values of these constants may be improved by following the methodology of 
the proof of Theorem [b] as many computations will then be made simpler. 

Let 

n 

Cy := sup T(q)JTg J + , where qf max{l, |gj|}. (57) 

q=(<jl,...,g„)gZ™\{0} i=l 

It is a well known fact that, under the assumption that T is monotonically decreasing 
in each variable, relation implies that 0 < C ^ < oo. Also define the constant 

_ II hIoo _ 

2 2d(2i —l)(n+l) ; 

teZ" 

which is clearly finite and positive as the sum converges. 

Theorem 6 . Let U C be a ball whose radius satisfies (146|) and let f = (/j,..., f n ) 
be an n-tuple ofC l+1 functions satisfying Assumption 1. Let L, Kd and E be given 
by (ESD , (1551) . (ITT]) and (llifiD respectively and let 

K 0 = E (Vn + d+ 1 • 2 n - 1/2 V^Zh)^ 


^ l/d( 2 Z— 1 ) 


Given any 5 > 0, let 



k := min <| 

1 5 i 

/ ^ \ d(n+l)(2l— 1) 

Cy2 n ~ 1 / 2 VndL 2A7AV' 

\2K 0 S n ) 

Then 

|B n (tt,«,A4)| d >(l- 

WU- 


(58) 


Clearly the above is an explicit version of Theorem [3] in the case when // is Lebesgue 
measure. The arguments given in the proof of Theorem [2] are easily adapted to deal 
with the general situation. 
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6.1. Proof of Theorem [6] modulo Theorem [5j For k > 0 and any q G Z n , define 


A(k] q) := {x G U : ||f(x) ■ q|| < Rd'(q) & (jiOjl holds} 

and 

A c (k; q) := {x G U : ||f(x) • q|| < R'h(q) & (l40l) does not hold} . 
Clearly it suffices to prove that 


U ^(«;q) 

q€Z«\{0} 


<||U! 


and 


U dl c (K;q) 

qSZ™\{0} 


<?|UL. 


(59) 


By Theorem [4] with 5' = Ac\l/(q), we immediately have that |v4(k; q )U < AdRT(q)|U| d . 
Then, summing over all q G Z n \ {0} gives 


U 

qSZ™\{0} 


< K d T,^K\\J\d < - \U\ d . 
d 


(60) 


Now to establish the second inequality in (T59|) . given an n-tuple t = (t 1; ... ,t n ) G Nq, 
define the set 

A l~ U A c (k: q), (61) 

q=(gi,..., 5 „)eZ n \{ 0 } 

2 t i<qf<2 t i + 1 


where qf = max{l, |gj|}. Observe that 


U <») = U A i ■ < 62 > 

qez™ teNg 


By (1571) an d the monotonicity of T in each variable, for every q = (q 1} ..., q n ) G Z n \{0} 
satisfying the inequalities 2 U < qf < 2 ti+1 , we have that 


and 


Now let 


*(q) < c* n a + < n 2_ti =^2" 




i=l 


IQ11 oo ^ 2 


maxi ti+1 


5' = hiCy 2 K = VndL2 max< ti+1 and T t = 2 ti+1 (1 < i < n). 

Then, A £ is easily seen to be contained in the set A(5', K, T) defined within Theorem [5] 
Clearly Tf,... ,T n > 1 and K > 0. Since k < C^ 1 , we have that 0 < S' < 1. Finally, 
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(T53|) is satisfied, since 

S'KTi . T n _ 2“+ 1 

maxj = i r .. >n Ti 2 max i t i+ 1 


KC*2 n VndL nC^-^VndL 


< 1 , 


( 63 ) 


2(maxi ti+l)/2 211*1100/2 

where the last inequality follows from the definition of k. Therefore, Theorem [5] is 
applicable and it follows that 

l/rf(2Z—1) 


\A c t \ d < | A(S’, K. T)\d < E (Vn + d+ 1 • £ 1 ) 


Md, 


where E is given by (1561) and where, from (l63l) . the dehnition of S' and the fact that 

HvEty 1 , 


E\ = max 


nCq,2 


nCq, 2 n 1 yndL 


1 \ 


r 

l-( 


2ll t ll°o/2 

Then, using (162|) and summing over all t £ Nq, we find that 


kCx q>2 n 1 VndL \ 
211 * 1100/2 1 


n+1 


U A c (k; q) 

qeZ" 


£ E iVn + d + 1- 1 

nCq, 2 n 1 \/ndL 

2ll*lloo/2 

teNg V \ 

KoSnK^+W-^lXJld 

< ||UU, 


1 \ l/d(2Z—1) 

n+l 


|U| 


where the latter inequality follows from the dehnition of k. This establishes the secound 
inequality in and thus completes the proof of Theorem [ 6 ] modulo Theorem 0 


7. Proof of Theorem [5] 

To establish Theorem 0 we will follow the basic strategy set out in the proof of [Sj, 
Theorem 1.4]. We stress that non-trivial modifications and additions are required to 
make the constants explicit. To begin with, we state a simplified form of [ 8 ] Theo¬ 
rem 6 . 2 ] and, to this end, various notions are now introduced. 

Given a finite dimensional real vector space W, v will denote a submultiplicative 
function on the exterior algebra f\ W ; that is, v is a continuous function from f\ W to 
M+ such that 

u(tw) = |f| i/(w) and z/(u A w) < z/(u)z/(w) (64) 

for any i 6 K and u, w e /\ W. Given a discrete subgroup A of W of rank k > 1 , 
let i/(A) := z/(v 1 A • • • A v fe ), where v 1; ..., v*. is a basis of A (this dehnition makes 
sense from the hrst equation in (I6T1P . Also, £(A) will denote the set of all non-zero 
primitive subgroups of A. Furthermore, given C,a > 0 and V C M d , a function 
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/ : x G V (->• /(x) G 1 is said to be (C, a)- good on V if for any open ball B C V and 
any e > 0 , 

|{x G B : |/(x)| < esup|/(x)|}| d < Ce a \B\ d . 

xS-B 

Theorem 7. (|El Theorem 6.2]J Let W be a d + n + 1 dimensional real vector space 
and A be a discrete subgroup ofW of rank k. Let a ball B = 5(x 0 , r) C R d and a map 
H : B —> GL [W) be given, where B := 3 k B. Take C, ct > 0, 0 < p < 1 and let u be a 
submultiplicative function on f\W. Assume that for any T G £(A), 

(i) the function x —> u(H{pc)T) is (C, a)-good on B, 

(ii) there exists x e B such that u(Hfx) T) > p, and 

(iii) for all xeB, ^{T G £(A) | u(H(x.)T) < p} < 

Then, for any positive e < p, one has 

|{x G B : 3 v G A \ {0} such that v(H(x)v) < e\\ d 

Theorem [5] is now deduced from Theorem [7] in the 
to the parameters appearing in Theorem [TJ. we let 

W = R ra+d+1 

and 

z/* be the submultiplicative function introduced in [51 §7]. 

There is nothing to gain in formally recalling the definition of z/*. All we need to know 
is that 14 as given in [ 8 ] has the property that 

14 (w) < ||w || 2 V w G /\ W ( 66 ) 

and that its restriction to W coincides with the Euclidean norm. Next, the discrete 
subgroup A appearing in Theorem [7] is defined as 

^0 j G R n+d+1 : P GZ,qGZ"J. (67) 

Note that it has rank k = n + 1, therefore the ball B appearing in the statement of 
Theorem [7] coincides with the ball U defined by Finally, we let the map H send 
x G U to the product of matrices 

#(x) := DU X1 (68) 

where 

(l 0 f(x) \ 

£4 := 0 I d Vf(x) 

\0 o I n J 



oo. 


< k(3 d N d ) k C ( - ) 15),. (65) 


following manner. With respect 


^ SL n+ , + 1 (M) 


(69) 
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and D is the diagonal matrix 


D := 


diag 


f El El El El £i\ 

U'’ AT’ K’ Ti'"'T n ) 

d times 


( 70 ) 


defined via the constants S', K, T),..., T n , and eq appearing in Theorem [5] 

With the above choice of parameters, on using (| 66 |) . it is easily verified that the set 
A(8',K,T) defined by (T54|) within the context of Theorem [5] is contained in the set 
on the left-hand side of (j65j) with e := £\\fn + d + 1. The upshot is that Theorem [5] 
follows from Theorem [7] on verifying conditions (i), (ii) and (iii) therein with appropriate 
constants C, a and p. With this in mind, we note that condition (iii) is already 
established in [ 8 ) §7] for any p < 1 . In )JH] below, we will verify the remaining conditions 
(i) & (ii) with the following explicit constants : 


where 


C := 


C{ := max 


d(n + 2) (d(n + 1) + 2) \ a ^ 2 


max (Ci, 2C d} i ), 


2 M 2 d+2 , r/r . M f2l l + l' 1/l ' 

■ dl{l + 1 )- 


s 0 • er(Z, d) ’ V d 
(here, a(l,d) is the quantity defined in (J48]) ), 

2 d+1 dl{l + l) 1 ^ 


s 0 \cr(l,d) 


C d ,i : = 


V d 

1 


a : = 


d(2l - 1 ) 

and p = p as defined by (152|) (note that p < 1). This will establish Theorem [5] 


(71) 

(72) 


(73) 

(74) 


8. Verifying conditions (i) & (n) of Theorem [7] 

Unless stated otherwise, throughout this section, A will be the discrete subgroup 
given by (1571) and T G £(A) will be a primitive subgroup of A. Verifying condition (i) 
of Theorem [7] is based on two separate cases : one when the rank of T is one and the 
other case of rank > 2. 

8.1. Rank one case of condition (i) . The key to verifying condition (i) in the case 
that T is of rank one is the following explicit version of [8j Proposition 3.4], Notice that 
it and its corollary are themselves independent of rank and indeed T. 

Proposition 1. Let U C W 1 be a ball, T C C l+1 be a family of real valued 

functions and A and 7 be positive real numbers such that : 

(1) the set {V/ : / G T} is compact in C l ~ l 
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( 2 ) sup sup |^/(x)| < A for any multi-index (3 G Ng with 1 < |/3| < l + 1, 
/e-E xgu+ 


(3) inf sup sup 

■t £jr uGS d—1 1 <k<l 


gkf 

du k 


( x o) 


> 7 , where x 0 is the centre of U, 


(4) — ^ +( | + 2 ^/^^ — r > where r is the radius of U as defined in (j46l) and cr(l,d) is 
defined in 


Then, for any f € T , we have that 

(a) / is (Ci, ji)-good on U, 

(b) HV/IL is (Ci, 5 (pYy)- 50 od on U, 


where 


Ci = Ci ( 7 , A) := max 


2A 2 d + 2 .A /2Cfl\ 1/P 

7 • Vd + 7 U(^^)y 


(75) 


Remark. Hypothesis (2) is additional to those made in [HI Proposition 3.4]. In short, 
it is this “extra” hypothesis that yields an explicit formula for the constant Ci- Note 
that by the definition of Cf as given by (1721) . we have that 

C* = Ci(s 0 , M). 

Using the explicit constant Ci appearing in Proposition [U it is possible to adapt the 
proof of [5, Corollary 3.5] to give the following statement. 

Corollary 3. Let U C R d be a ball and t — (fi,, f n ) be an n-tuple of C l+1 functions 
satisfying Assumption 1. With reference to Proposition |7J let 

7 := s 0 and A := M. 

Then, for any linear combination f = Co + Y^=\ c ifi w ^h cq, ..., c n G M, we have that 

(a) / is (Cl, 4.) -good on U, 

(b) ||V/Hoo is (cr, -^j-^j-good on U. 

Corollary [3] allows us to verify condition (i) of Theorem |T| in the case that P is 
a primitive subgroup of A of rank 1. Indeed, in view of (168|) and of the discussions 
following equations (164|) and (l 66 lh i/*(H(x.) T) is the Euclidean norm of Hfx .)w = DU x w, 
where w is a basis vector of T. It is readily seen that the coordinate functions of 
H (x)w are either constants, or /(x), or df(x)/dxi for some f = cq + Y^=i c ifi with 
c 0 ,...,c n G M. Hence, by Corollary |H] and [ 8 ’ Lemma 3.1 (b,d)] we obtain that the 
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function ||-ff’(-)r|| 00 is {Cf, a)-good on U, where a is given by (j74lh In turn, on using [8] 
Lemma 3.1(c)] and the fact that 

1 ^ ll#( X Hloo -| 

y/n + d+l ~ ||iL(x)w|| 2 “ 

we have that z/* (H( . )T) is (Cf{n + d + l)"/ 2 , a) —good on U. It then follows from [8[ 
Lemma 3.1(d)] that 

z/* ( H{. )T) is ( C , a) — good on U. 

Proof of Corollary^ In view of [8j Lemma 3.1.a], it suffices to prove the corollary 
under the assumption that ||(ci,... ,c n ) || 2 = 1. Thus, with reference to Proposition [1] 
define 

J~ . \ Co T ^ ^ Cj/i • || (Cl, • • • , C n ) 112 1 

l i=1 

The corollary will follow on verifying the four hypotheses of Proposition [0 Thus, 
hypothesis (1) is easily seen to be satisfied. Hypothesis (2) is a consequence of (ITiD and 
of the Cauchy-Schwarz inequality while hypothesis (3) follows straightforwardly from 
the definition of the measure of non-degeneracy So in (1421) and (ITT!) . Finally, hypothesis 
(4) is guaranteed by (1461) and the choices of 7 and A. □ 



8.2. Proof of Proposition [H The proof of Proposition |T] relies on the following 
lemma : 


Lemma 3. Let f be a real-valued function of class C k (k > 1) defined in a neigh¬ 
bourhood o/x G (d > 1). Assume that there exists an index 1 < i 0 < d and a real 
number ji > 0 such that 


oyi 

dx k 0 



> h- 


Then there exists a rotation S : —)■ M. d such that 


d k (foS) 
dx k 



> /i • a(k, d) 


for all indices 1 < i < d, where the quantity a{k,d) is defined m fl48l) . 


As the proof of Lemma [3] is lengthy, before given it, we show how to deduce Propo¬ 
sition [1] from it. 

Deduction of Proposition Q] from Lemma\f^ Let x 0 = (17,^2 ,... ,i’d) denote the centre 
of U. Hypothesis (3) of Proposition [Tj implies that for any / 6 J, there exists a unit 
vector u e S d_1 and an index 1 < k < l such that 


oyi 

du k 


( x o) 


> 7- 














DIOPHANTINE APPROXIMATION AND INTERFERENCE ALIGNMENT 


27 


Even if it means applying a first rotation to the coordinate system that brings the 
X\ axis onto the line spanned by the vector u, it may be assumed, without loss of 
generality, that the above inequality reads as 

|<9i/(x 0 )| > 7 . 

From Lemma |3l up to another rotation of the coordinate system, one can guarantee 
that 

|<9f/(x 0 )| > 7 ’<r(k,d) := C 2 

for all indices 1 < i < d. Now, for a fixed index i, it follows from a Taylor expansion 
at x 0 that, for any x = (aq,..., Xd) G U + , 


dif (x) = d\f (x 0 ) + ^(x; x 0 ) (xj - Vj ), 

3 = 1 

where, by hypothesis ( 2 ), i?,j(x; x 0 ) satishes the inequality 

|-Ri(x;x 0 )| < sup | (dj o <9f) /(x)| < A. 

xetJ+ 

In view of hypothesis (4), we have furthermore that 


||x — x 0 || 2 < 3 

Thus, for all indices 1 < i < d, 


l+d+2r < 7 ' <r{l, d) < _C^ 


2Vd\ 2Vd\ 


(x)| > I Xj - Uj\ X = C 2 - X ||x - x 0 1| 1 

i=l 

> C 2 — XVd ||x — x 0 1| 2 > 


C 2 


(76) 


Next, observe that any cube circumscribed about U lies inside of U + . It then follows 
on applying [ 8 l Lemma 3.3] with A\ = A and A 2 = C 2 /2 that the function / is (C", ^)- 
good on U, where 

r\dL / n\ \ 

C' := v dk(k + i) (—^(k + !) (2k- + 1)) 


od +2 \ 

< ——dk(k + 1)- 
V d 7 

A computation then shows that 


/2 k k + l\ 1/k 

V <r(k,d) J 


C' < 


2^+2 

~vT 


dl(l + 1 ) 


A 

7 


(2 l l + 1 

Va(/,d) 


1 n 


Part (a) of Proposition |T| is now a consequence of [SJ Lemma 3.1.d]. Regarding part 
(b), the proof is essentially the same as that of [ 8 j Proposition 3.4.b] with the constant 
C replaced with the explicit constant C\ given by (T73]i . 
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□ 

We now proceed with the proof of Lemma [3] which requires several intermediate 
results. The first one is rather intuitive. 

Lemma 4. Let C > 0 be a real number and p > 1 be an integer. Then every section 
of the cube [0, C] p with a (p — 1)-dimensional subspace of W has a volume at most 
V2CP- 1 . 


Proof. See jU Theorem 4], □ 

Lemma 5. Let k > 1 denote an integer and let w := (w 0 ,..., Wk) G M fc+1 . Letu , B > 0 
be real numbers. Furthermore, assume that the k + 1 real numbers 0 < to < • • • < £& 
satisfy the following two assumptions : 

(!) min \ f i ~ t i\ > W 
(2) max \tA k < B. 

0 <i<k 


Then, there exist an index 0 < j < k such that 

k 

i =0 

where (j)(cu,B; k ) is the quantity defined in (l49j) . 


> 0((n, B;k) ■ ||w 


The following notation will be used in the course of the proof of Lemma [5] : given a 
point x 6 l" and a set Act", dist(x, A) will denote the quantity 

dist(x, A) \= inf{||x — a|| 2 : a <G A}. (77) 


Proof. Let X := (xi, • • • , x*; + i) G denote the matrix defined by the following 

k + 1 column vectors in M fc+1 : 


X! := 


Xfc+i := (1 ,t k ,...,t k k ) T . 


Together with the origin, these points form a simplex «S(X) in R fe+1 whose volume 
|«S(X )| fc+1 satisfies the well-known equation 


1 

(k + 1)! 


det 



Xfc+i 0 \ 

1 f J' 


l 5 (X )| fc+1 
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The formula for the determinant of a Vandermonde matrix together with hypothesis 
(1) then yields the inequality 


l 5 ( X )lfc +1 


w fc(fe-l)/2 

- (k + 1)! ' 


Note that hypothesis (2) implies that all the vectors xi, • • • , x/- + i lie in the hypercube 
B := [0,i?] fc+1 . As a consequence, the volume of the section of the simplex «S(X) with 
any hyperplane does not exceed the volume of the section of «S(X) with B which, from 
Lemma IH is at most \/2B k . Also, given a hyperplane V. it should be clear that 


l$(X )| fc+1 < 


max dist 

i<i<fc+i 


h v) ■ |pns(X)|, 


The upshot of this discussion is that the following inequality holds : 

u k(k-i)/2 

max dist (xj.V) > — -= - := 6(uj,B,k). 

i <i<k V 31 J ~ 2V2B k (k + l)\ V ' 


(78) 


Consider now the hyperplane V = and let j be one of the indices realizing the 
maximum in (1751) . The conclusion of the lemma is then a direct consequence of the 
equation 

dist (x 7 -, w^) 

ll w ll 2 


El 


o w itj 


□ 


The next result contains the main substance of the proof of Lemma [3] 

Lemma 6. Let f be a real valued function of class C k (k > 1) defined in a neighbour¬ 
hood of (aio, ?/o) G ® 2 - Let c > 0 be a real number such that 


Qkf 


dx k 


(xo,yo) 


> c. 


Then, there exist two orthonormal vectors u, v e S 1 such that 


nun 


d k f 


du k 


(x 0 ,yo) 


d k f 


dv k 


(zo,yo) 


> 


23fc/2 

= c • a(k , 2). 


• (f ( [V2{2kf +k{k - l)/2 (k + l)l) 1 ,2, k 


Proof. Set 

w = (»o, ■■■.--.)-( g) e R ‘ +1 - 

It readily follows from the assumptions of the lemma that 
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Let A > 0 be a real number such that, for all indices 0 < j < k, 

d k f 


dx k ~idyi 


j(xo,yo) 


< A. 


We thus have the inequality 


|w|| 2 < 2 k (k + 1)A. 


( 79 ) 


Define now k + 1 real numbers to,... ,tk as follows : 

1 i 

U '- = 2 + W 

where i — 0 ,..., k. 

With the choices of the parameters uj := 1/(2 k) and B = 1, Lemma [5] applied to the 
vector w and to the system of points (ti) 0 <i<k yields the existence of a point tj £ [1/2,1] 
such that 

k 




i=0 


> ||w|| 2 - 0 ( —, 1, k 


Let 


denote a constant and 


-Tk 


(80) 


(81) 


g : t £ [0,1] i->- Wit 1 
a function. Note that for all t £ [0,1], 


i=0 


!</(*)! = 


Wilt 1 1 


i=1 


< 2 k X ■ k ^ k + = 2 k ~ 1 Xk{k + 1). 


This implies that for all t £ [tj — e, tj + e], where tj is the constant appearing in 
the following inequalities hold : 


\g(t)\ = 


y ^Wjtf 


i=0 


> I - \g(t)-g(tj)\ 


> \g{tj)\ — e ■ 2 k 1 Xk(k + 1) 


> 

(fsotfcifsn 


4> ^ ■ (ll w ll 2 - 2 fc 2 K k + !)) 
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Consider now the image [a, b] C [1, 2] of the interval [tj — e, tj + e] D [1/2,1] under the 
map t K > 1/t. It is then readily verified that 

\b — a\ > e. 

With the choices of the parameters 

“ := rrd 5 ,u ) 

and B — 2, apply once more Lemma EH this time to the vector ((—l)*Wi) 0 <j< fc an d to 
the set of points 

b — a 

ti = a -(--— • z, 0 < i < k. 

k 

This yields the existence of tj £ [a, b] such that 

k 

£M)‘ ro d 

i=0 



The upshot of this is that, when considering the point s := 1 /tj, the following two 
inequalities hold simultaneously : 


and 



i =o 



E(-i)^ s_ 


t =0 


> II w I 




Since s £ [1/2,1], it is easily seen that one can find a unit vector (wi,w 2 ) £ S 1 such 
that s = 112 /ui and ui,U 2 £ [l/(2y / 2), 1]. Let u £ S 1 and v £ S 1 denote the two 
orthonormal vectors defined as u := (-Ui,w 2 ) and v := (w 2 , —Wi). 


Note then that 

^ 2h3*tH w I 

- 2 1 +“/ 2 

and, similarly, 


E 

i= 0 


Wiu\ l u l 2 




E 

i=0 


WiS 


k 

i=0 


«2 


k 

Ei-pVis-* 

1=0 


> 


c 

23^/2 
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Since, from the definition of the vector w, 


and 


X/ 

du k 

X/ 

&v k 


(x 0 ,yo) = ^w i u k l 


k ~ i ui 


i =0 


(xo,Vo) = ^2(-l) l WiU k X, 


i =0 


this completes the proof of the lemma from the definition of (j) in (l49lh 


□ 


We now have all the ingredients at our disposal to prove Lemma EJ 

Proof of Lemma ^ 1 Denote the coordinates of the vector x e as x = (x ±,... ,Xd)- 
Even if it means relabeling the axes, assume furthermore without loss of generality that 
i 0 = 1 in the statement of the lemma. The proof then goes by induction on d > 1, 
the conclusion being trivial when d — 1. When d = 2, Lemma [3] reduces to Lemma H 
Assume therefore that d > 3. ft then readily follows from the induction hypothesis 
applied to the function (xi,..., Xd- i) G M d_1 f(x i,..., Xd-i,Xd) that there exists a 
rotation Si : such that 


d k (foS 1 ) 

dx k 



> /x ■ a(k, d — 1 ). 


Consider now the function (x\,Xd) G M 2 t-)- f(xi, ..., Xd-i, Xd)- Applying Lemma i 
to this function with c = n ■ cr(k, d — 1 ) therein provides the existence of a rotation 
S 2 : n- acting on the plane (xi,Xd) and leaving its orthogonal unchanged such 

that 


min 


d (/ o Si o S 2 ) 

dx k 


d(f ° S 1 o S 2 ) 

dx k 



> jx ■ a(k, d — 1 ) • cr(k, 2 ) 


H ■ cr(k, d). 


The lemma follows upon setting S = Si o S 2 . 


□ 


8.3. Higher rank case of condition (i). The key to verifying condition (i) of Theo¬ 
rem |T] in the case when T is of rank greater than one is Proposition [2] below. In short, 
it is an explicit version of [ 8 ] Proposition 4.1] in the particular case when the set Q 
appearing therein is given by 

Q := {(ui • f, u 2 • f + u 0 ) : u 0 G M, U!, u 2 G M”, u x ± u 2 } . (82) 

The statement is concerned with the skew gradient of a map as defined in [ 8 ] §4], We 
recall the definition. Let g = (r/i, r/ 2 ) : U + —> M 2 be a differentiable function. The skew 
gradient Vg : U + —> M 2 is defined by 

Vg(x) := <7 i(x)V(7 2 (x) - ^ 2 (x)V^i(x). 
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If we write g(x) in terms of polar coordinates; i.e. via the usual functions p(x) and 
0(x), it is then readily verified that 

Vg(x) = p 2 (x)V 6 »(x). (83) 

Essentially, the skew gradient measures how different the pair of functions g± and g 2 
are from being proportional to each other. 

Proposition 2. Let U C M. d be a ball and f = (fi, ■ ■ ■, f n ) be an n-tuple of C l+1 
functions satisfying Assumption 1. Let p 2 , C d ,i and Q be given by (I5TJ) . (l73jl and (j82j) 
respectively. Then, 


(a) for all g G Q, 


l|Vg || 2 is 



d(2l - 1 ) 


— good on U 


(b) for all g G Q, 

sup ||Vg(x )|| 2 > p 2 . 
xe u 


(84) 


This proposition together with Corollary [3] and the basic properties of (C, cx)-good 
functions given in [ 8 ( Lemma 3.1] enables us to deduce the following statement, which 
establishes condition (i) in the higher rank case. 

Corollary 4. Let U C be a ball and i — (fi,, f n ) be an n-tuple of C l+1 functions 
satisfying Assumption 1. Let A be the discrete subgroup given by (167|) and T G £(A) 
be a primitive subgroup of A. Furthermore, let H be the map given by (1681) . Then, the 
function 

x i y v* (iL(x)T) (85) 

is (C, a)-good on the ball U with constants C and a given by (TTTT) and fTbl|) respectively. 

Proof. Let k denote the rank of T. The case k = 1 has already been established 
as a consequence of Corollary [3] in 1 18.11 Assume therefore that k > 2. It is shown 
hi [ 8 ] §7 Eq(7.3)] that there exist real numbers a,b,p G M such that, for all x G 
U, z/* (if(x)T) given by (1851) can be expressed as the Euclidean norm of a vector 
w(x). Furthermore, there exists an orthonormal system of vectors of the form S = 
{e 0 ,e*,... ,eS, vi,..., v fe _i} when k < n or of the form S = {e 0 ,e*,..., e^, v 0 , ..., v fc _i} 
when k = n + 1 such that w(x) is a linear combination of 

(k + 1 ){dk + 2) 


L d (k) : 


2 
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skew products of elements of S whose coefficients are of any of the following form : 



a + bi ■ v 0 


( 86 ) 


b 


(87) 


bi -Vi 

(1 < i < k - 1) 

( 88 ) 


b/J,d s (i ■ v^ 

(1 < i < k, 1 < s < d) 

(89) 


s) 

(1 < i < k — 1, 1 < s < d) 

(90) 


bnY(i,j,s) 

(1 < i < j < k — 1, 1 < s < d), 

(91) 

where 

X(i,s) := (f • v^ 

1 d s (a + bi ■ v 0 ) - (a + bf ■ v 0 ) d s (f • v*) 


and 





s ) := (f • d s (f • Vj) - (f • Vj) d s (f • v*). 


• It follows from part (a) of Corollary [3] and [ 8 j Lemma 3.1(a,d)] that the coordi¬ 
nate functions given by (1861) . (187)) and (j 88 j) are (C", a)-good, where 

C' := C 

{L d {n + 1) • d ) a/2 

• It follows from part (b) of Corollary [3] and [ 8 j Lemma 3.1(a,d)] that, when the 
index i is fixed, the maximum over s of the coordinate functions given by (189)) . 
that is, the quantity || 6 /x V(f • Vj)||oo, is (C',a)~good. 

• It follows from Proposition [2] and [ 8 j Lemma 3.1(a,d)] that, for fixed indices 
i and j, the Euclidean norm over s of the coordinate functions given by (1901) 
and (ED), that is, the quantities ||/i V(f • v*, a + bi ■ v 0 )|| 2 and \\b /i V(f ■ v, : , f • Vj )|| 2 
respectively, are (C", a)-good. On using the relation 

1 < Hjp<i 

valid in and [SJ Lemma 3.1(c)], it follows that ||/i V(f • v,, a + bi ■ v 0 )||oo and 
||6/iV(f • Vj,f • v^lloo are (d a/2 C\a)- good. 

The upshot of the above together with P, Lemma 3.1(b)] is that the maximum of 
the coordinate functions (l86]Ml9Tf is (d a ^ 2 C, a)-good. In turn, on using the relation 

1 <i!_!k<i 

V L d (k) || • lh 

valid in and P Lemma 3.1(c)], we have that 

*4 (H( . )r) is (C\d ■ L d (k )) a / 2 , a) - good. 
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As k < n + 1 , the desired statement follows. 

□ 

Modulo the proof of Proposition [21 we have completed the task of verifying condition 
(i) of Theorem [7] The proof of the proposition is rather lengthy and therefore is 
postponed till after we have verified condition (ii) of Theorem [71 


8.4. Verifying condition (ii) of Theorem [7] modulo Proposition [2j The following 

lemma, which although not explicitly stated, is essentially proved in [HI §7], see [HI 
Eq(7.5)] and onwards. The key difference is that we make use of Proposition [2] in place 
of [ 8 ^ Proposition 4.1] and so are able to give explicit values of p\ and p. 

Lemma 7. Let U C be a ball and f = (/ 1; ..., f n ) be an n-tuple of C l+l functions 
satisfying Assumption 1. Let p \, p > 0 be given by (l50|) and (152|) respectively and 
assume that for any v G S n_1 and p G M we have that 

sup |f(x)-v +p\ > pi and sup ||V (f(x)-v)|| oo > (92) 

xeu xeu 

Furthermore, let A be the discrete subgroup given by (1H7|) . T G £(A) be a primitive 
subgroup of A and H be the map given by (l68l) . Then 

sup z/*(i7(x)r) > p. 

xeu 


The following statement immediately verifies condition (ii) of Theorem [71 It is the 
above lemma without the assumptions made in f[92jl . 

Corollary 5. Let U C R d be a ball and f = (fi,, f n ) be an n-tuple of C l+1 functions 
satisfying Assumption 1. Let A be the discrete subgroup given by (1671) and V G £(A) be 
a primitive subgroup of A. Furthermore, let F[ be the map given by (j68l) . Then 

sup z/*(if(x)r) > p, (93) 

xeu 


where p is given by (j52j) . 


Proof of Corollary O The desired statement follows directly from Lemma 0 on verify¬ 
ing the inequalities associated with (1921) . Let v G By the definition of So := 

s(l] f, U + ), there exists a u G and 1 < k < l such that 


d\i ■ v) 
(h\ k 


( x o) 




(94) 
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Recall, that x 0 is the centre of U. It follows that for any x G U, we have that : 


d k i 


dH 


(dH dH \ 

V du k X 

— 

v 'au‘ (xo) 


V '(au‘ (Xo) du^ x ) 


> So 

Let s' denote the unit vector 


dH , d k f 

(xq) - ~ r l X J 


<9u fc du k 

x-x 0 


( 95 ) 


s := 


ll x - x o || 2 

By Lagrange’s Theorem, there exists x' between xq and x such that 


d k i . d k f 

;(X 0 ) = ~ r l X ) 


d (d k i 


<9u fcV u/ du k y ' ' 1 0 12 (9s' \du k 

It then follows from ([95]) and the definition of M in (051) that 


v- 


d k f 
du k 1 


> sq — M ||x — x 0 | 


This together with the fact that r < Sq/2M — a direct consequence of 
that 

d k (f ■ v) 


du k 


> s 0 /2 V x G U . 


implies 

(96) 


The upshot is that the hypotheses of [8] Lemma 3.6] are satisfied. A straightforward 
application of [SJ Lemma 3.6] together with (150]) implies that 


sug|f(x) -v+p| > 2fc ^° / ^ 1)! 


( 2 r) k > 


so 


r(2 r) 1 = piVd > p 1 


4l l (l + 1)! 

for any p G R. Thus the first inequality appearing in (1921) is established. 

It remains to prove the second inequality in (192]) : that is, that for any v G 

sup ||V (f(x)-v)|U > PI - m /° 1)lv/5 ( 2r )'. 


(97) 


Recall from above that for any v G S n 1 we can hnd a vector u G 1 such that (J94|) 
and (1961) hold. Furthermore, observe that 


c>(f ■ v) 

du 


(x) = u 4 • V (f (x) • v). 


( 98 ) 


We proceed by considering two cases, depending on whether or not k — 1 in 
• Suppose k — 1 in (1941) . Then it follows from (196|) and (198|) that 


s o 


u*- V (f (x) ■ v) > — V x G U . 
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On applying the Cauchy-Schwartz inequality, we obtain that 
l|V(f(x) • v)|| 2 > y V x G U . 

This together with the fact that || . || 2 < \fd || . implies the second inequality 
appearing in (l92j) . 


• Suppose k > 2 in (l94|) . Consider the function g(x) 
Then by (1961) . we have that 


d k 1 g 
<9u fc_1 




Vx G U. 


9 (x) defined on U. 


Thus, the hypotheses of [8, Lemma 3.6] are satisfied for the function g(x) and 
a straightforward application of that lemma together with (1501) implies that 

sup |s(x)| > _' 8 i)f-i /! ( 2r ) t ~ 1 > px ^~ d ' (") 

Now the Cauchy-Schwartz inequality and (j98l) imply that 

IIV (f(x) ■ v)|| 2 > |u‘ • V (f(x) ■ v)| = | 9 (x)| V x 6 U . 

This together with (j99[) and the fact that || . || 2 < Vd\\ . imply the desired 
statement; namely that 

sup||V(f(x) •v)|| oo > Pl . 
xeu 


□ 


The upshot of £j8]is that we have verified conditions (i) & (ii) of Theorem |T] as desired 
modulo Proposition [2j 


9. Proof of Proposition [2] 


In order to prove Proposition [2l we first establish an explicit version of [8, Lemma 
4.3]. Throughout this section, the notation introduced in (1771) will be used. 

Lemma 8. Let B C be a ball of radius 1 and let denote the hypercube cir¬ 
cumscribed around B with edges parallel to the coordinate axes. Assume further that 
P — (PuP'i) : B t-G M 2 is a polynomial map of degree at most l > 1 such that 

| p ( x )- p ( y )|| 2 < 2 ( 100 ) 


sup 

X,y eBo 


and 


sup dist (£, p(x)) > - 

xeB 8 

for any straight line C C M 2 . Then, 


( 101 ) 


su p|l Vp(x) || -2 > 


xeB 


- - - f 

86 016 VW V 


1 + sup ||p(x) 

x6B 


( 102 ) 
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and 

sup ||Vpi(x)|| 2 < 2 l 2 \fd. (103) 

xgS, 2=1,2 


Proof. Regarding (I102[) . if we assume that sup xgB ||p(x)|| 2 > 6, the argument used 
to prove [SJ Lemma 4.3] gives the stronger inequality in which the constant factor 
1/(86 016-\/T0) is replaced by 1/64. Thus, without loss of generality, assume that 


SUP„ efl IIPMIIj <6. 


(104) 


It is easily inferred from (1 101 1) that there exists X] G R, the closure of B , such that 
||p (xi)|| 2 > 1/8. Working in polar coordinates and choosing the straight line C\ joining 
the origin to p (xi) to be the polar axis, let (p(x), 6>(x)) denote the polar coordinates of 
a vector x G M 2 . Thus, p(p(xi)) >1/8. Furthermore, from (1101ft . there exists X2 G B 
such that dist (£i,p(x 2 )) > 1/8 and therefore, together with (11041) . we have that 


|#(p(x 2 ))| > |sin 6 (p (x 2 ))| 

> 1/8 = J_ 

“ 6 48 


dist (£i,p(x 2 )) 
P (p(x 2 )) 


(105) 


Now let A be the straight line joining p(xi) and p(x 2 ). Furthermore, let £ 2 denote 
the ^-coordinate axis, (x\,y\) the Cartesian coordinates of p(x 2 ) and (p(p(xi)),0) the 
Cartesian coordinates of p(xi). Then the Cartesian equation of A is 


A : yix - (xi - p(p(xi)))y - p(p(xi))?/i = 0. 

It follows from the choice of the points Xi and x 2 together with (HOOD . (11011) and (11041) 
that 


^ < \yi\ < 6, p(p(xi)) > i and |xi - p(p(xi))| < 2. 
Therefore, the distance from the origin O to A satisfies the inequality 

l(p(p(x 1 ))y 1 | ^ (1/8) 2 1 


dist (A, O) = 


> 


y \ + { x \ -p(p(xi)) j 


V6 2 + 2 2 128\/l0 


(106) 


Let J denote the straight line segment [xi,x 2 ] and let u be the unit vector 

x 2 - X! 

u := --—• 

||x 2 - xi || 2 

Restricting p to J, Lagrange’s Theorem guarantees the existence of y G (x 1; x 2 ) such 
that 

r)0 

0 (p( x 2)) = w-(y)|J|- 


It then follows via (|83l) . (11051) and (11061) that 
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||Vp(y)|| 2 > |u- Vp(y)| = p 2 (y) 


39 < \ 

ar (y) 


> dist (A, O) |g( P ( , X2))l = dist (A, O) |9(p<X2))l 


PI 


|X 2 - xH 


> 


Thus, 

sup||Vp(x )|| 2 > 


128\/To x 48 x 2 12 288\/l0 

1 7 


> 


( 


1 + sup ||p(x) 


xeB 12 288 \/l0 86 016^ 86 016\/IoV xg b 

This completes the proof of (11021) . We now turn out attention to (1 1031) . 

Let i G {1, 2}. It may be assumed without loss of generality that Pi (0,..., 0) = 0 
and that the ball B is centered at the origin. Then, for given x 2 ,..., x d in R, consider 
the polynomial in one variable p (x) := pi ( x , a; 2 ,..., Xd), which is of degree at most l. 
It follows from (HOOD that 

sup \p{x)\ < 2 . 

M<i 

Hence by Markov’s inequality for polynomials, we have that 

dpi 



dp ( , 


sup 

da: 

= sup 

*I<1 

hi|<i 


(x 1 ,x 2 , ...,x d ) 


This together with the fact that 


dxi 

2 < \fd ||. implies that 


< 21 


max < sup ||Vpi(x )|| 2 , sup ||Vp 2 (x )|| 2 > < 2 l 2 Vd 

[xgB xg B 


and therefore completes the proof of the lemma. 


□ 


We now have all the ingredients in place to prove Proposition [2] 

Proof of Proposition^ The proposition is an explicit version of | 8 J Proposition 4.1]. 
Within our setup in which Q is given by (]82l) . the starting point for the proof of 
part (a) of [ 8 ], Proposition 4.1] corresponds to the existence of positive constants S, c, 
and a with 


0 < 5 < 1/8 and 2C d}l N d S 1/W2l - im ~ 2)) < 1 (107) 


such that for every g G Q one has 
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and 


VvGS 1 duGS ^" 1 3 A; < / : inf 

xGU 


V ■ 


d k g 

du k 



> c 


Sea 

m l+2 (i + i)\Vd 


(108) 

(109) 


for all multi-indices [3 with \/3\ = l. Here, £ = 2 l 2 \fd is the quantity in right-hand side 
of (I103p and the real number a is required to be less than the constant appearing in 
the right-hand side of (I102p . that is, 


1 

a < - =■ 

86 016^10 


( 110 ) 


The statements (I108p and (HOOP correspond exactly to [SI Eq(4.5a) & Eq(4.5b)]) with 
V replaced by U. 

The proof of part (a) of Proposition [2] follows from the existence of the constants S , 
c, and a as established in the proof of part (a) of [ 8 j Proposition 4.1], It remains for 
us to show that, given the definition of r in (|46l) . it is indeed possible to choose such 
constants in such a way that the relations (I107I) - (I110D hold. 

With this in mind, set 

S ■= V, 

where tj is defined by (147|) . It follows from the definition of r] and the well known bound 
Nd < 5 d for the Besicovitch constant (cf. Remark E] p JIBP that (11071) is satisfied with 
5 — rj. We proceed with verifying (I108p and fllOOp . 

Regarding (j!08p . let g = (ui ■ f, u 2 • f + Uq) G Q. Also, let u := (ui, u 2 ) with Ui, u 2 G 
and let v := (iq, n 2 ) G S 1 . Furthermore, let w denote the vector w := iqui +u 2 u 2 . 
Since by the definition of £/, the vectors Ux and u 2 are orthogonal, it follows that 
w G S n_1 . Now observe that for any multi-index f3 such that |/?| < l, 


v • dpg = w • dpi. 

By the definition of s 0 = s(l; f, U + ), there exists s G S d_1 and 1 < k < l such that 


d k g 

v '^ (Xo) 


d k i 

w ' 


> So- 


As per usual, xo denotes here the centre of U. It follows that for any x G U + , we have 
that 


d k g 


dH' 


(d k i, , dH, A 

Y ' as* (x) 

> 

w ' ST (X »> 


w '- sr (x) J 


> s 0 - 


d k f d k f 


2 


( 111 ) 
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The same arguments as those used to prove 

so 


d k g , , 

v 'Sr (x) 


> 


can be employed to show that 
V x g U + . 


This proves (1T08D with 


c : = 


£o 

T 


We now turn our attention to (11091) . With g and u as above, first note that for any 
x G U + and for any multi-index /3 such that \/3\ = l , we have that 

llfyg(x) -fyg(xo) !h = II (ui • (fyf(x) -a / 3 f(x 0 )), u 2 • (a / 3 f(x) -^f(x 0 )))|| 2 . ( 112 ) 
Next, note that from the Cauchy-Schwarz inequality, we have that, for i — 1, 2, 

(u* • (fyf(x) - ^f(xo ))) 2 < ||fyf(x) - ^f(xo)||l. (113) 

On combining (1112)1 and (11 13)1 . we find that 

jlfyg(x) -d p g (xo )|| 2 < V^||^f(x) -^f(xo)|| 2 . 

Now, since f satisfies Assumption 1 , in view of (j45l) . we obtain that 

||a / 3 f(x) -a / 3 f(xo )|| 2 < M\/2 ||x — xo || 2 . 

Hence, for any x,y 6 U + we have that 

l| 0 / 9 g(x)- 3 ag(y )|| 2 < 2 M(||x-x 0 || 2 +||y-x 0 || 2 ). 

In view of (1461) . we also have that 

11 x x 0 11 2 T 11 y x 0 11 2 < 2-3 n+d+2 ■ r 


< 


9 on+d+2 _ V S 0 _ 

' 4- !0 7 3 n+d+2 dMl l + 2 (Z + l)! 


VSo 

2 • 10 7 dMl l+2 {l + 1)! ' 

The upshot is that 

J , S y X " %g(X) '^ g(y) " 2 £ 10 ^'^ + l )! (U4) 

for any multi-index j3 with \ j3\ — l. This proves (1109)1 with 

32 

a: ~W7d' 

which clearly satisfies (11101) . 

To prove part (b) of Proposition [21 we closely follow the proof of part (b) of [HI 
Proposition 4.1]. The new ingredient in our proof is the calculation of explicit constants 
at appropriate places. With this in mind, let g = (<7 i,<? 2 ) G Q and take B appearing 
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at the start of the proof of [8] Proposition 4.1(b)] to be U, so that B = |U. We claim 
that there exists a point y G B such that 


|g(y)ll 2 > T ■= 


r l s 0 


Al l (l + 1)! 


(115) 


To see that this is so, take v := (1,0) G S 1 . In view of (11081) . there exists a vector 
u G S d_1 and 1 < k < l such that 

> c := ^ V x G U . 


v • dkg fxf 


dk 9i 

du k ^ ’ 


du k ^ 


Thus, on applying [SJ Lemma 3.6] to the function g\ and the ball B, we obtain that 

^ l9l(x) " 9l(y)l - WTTy. = WTTy' 

This implies the existence of a point yell such that 

lls(y)ll 2 > My)l > r 

as claimed. Next, observe that for any w G and any x G U, 

dg - 

^ Vu 2 • £(x) 

Therefore, on using the Cauchy-Schwarz inequality, we obtain via (l45l) that 

< V2M. 


|i(x) 

< 72 

di 

-(X) 

chv ; 

2 

dw ^ 


(116) 


Now, observe that, in view of (j46l) . of the definition of r and of the fact that so < M, 
we have that 

t < r M. 

Consider the ball B' C B = U with radius r/(2M) < r/2 centred at y, where y 
satisfies (11151) . Take a vector v G S 1 orthogonal to g(y). I 11 view of (I108|) . there exists 
a vector u G and 1 < k < l such that 


d k g 

v ' SF (x) 


>c=-y VxGU 


Thus, on applying [8j Lemma 3.6] to the function x —y v ■ g(x) and the ball B ', we 
obtain that 


s o 


™p|vg(x)i> . 


r y 


(117) 


On the other hand, the upper bound (11161) implies that 

T 

sup ||g(x) - g(y)|| 2 < 


x6B' 


V2M = —• 
2 M ^2 


(118) 
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The upshot of (II 17|) and (II 18j> is that we are able to apply [8, Lemma 4.2] to the map 
g : B' — y M 2 to yield (ISTjl and thereby complete the proof of part (b) of Proposition [2] 
For ease of comparison, we point out that the quantities a, 5 and w appearing in the 
statement of [SJ Lemma 4.2] correspond to r, t/\J2 and the right-hand side of (1 11 711 
respectively. 

□ 
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